Collecting and Organizing Web Content

نویسندگان

  • Mira Dontcheva
  • Steven Drucker
  • Geraldine Wade
  • David Salesin
  • Michael Cohen
چکیده

To collect and organize Web content today a user must make bookmarks, print whole webpages, or copy and paste pieces of webpages into a document. We present a framework for assisting the user in managing personal collections of Web content. The user interactively selects the webpage elements of interest, and the system builds an extraction pattern for those elements that is used to automatically collect content from analogous pages. Every new page the user visits is matched with existing patterns, and if matching page elements are found, the user can add them to the summary database. Layout templates filter the database to compose summary views. The user can view the database contents at any time with any layout template.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Interaction Techniques for Automating Collecting and Organizing Personal Web Content

Interaction Techniques for Automating Collecting and Organizing Personal Web Content Lubomira A. Dontcheva Co-Chairs of the Supervisory Committee: Affiliate Faculty Michael F. Cohen Department of Computer Science & Engineering Professor David H. Salesin Department of Computer Science & Engineering The growth of the World Wide Web has led to a dramatic increase in accessible information. Today, ...

متن کامل

Developments in Practice VIII: Enterprise Content Management

Enterprise content management (ECM) is an integrated approach to managing all of an organization’s information including paper documents, data, reports, web pages, and digital assets. ECM includes the strategies, tools, processes, and skills an organization needs to manage its information assets over their lifecycle. While many vendors would suggest that their software is a panacea, most knowle...

متن کامل

Experiences with Content Extraction from the Web

We present the results of a ten-week field study that explored the use of automatic Web tools for collecting and organizing Web content in the context of users’ personal tasks. Our findings show that people welcome automatic gathering of structured information, such as job or rental listings, and are eager to use rich visualizations and displays of content they find on the Web. We also found th...

متن کامل

Collaborative Refinery: A Collaborative Information Workspace for the World Wide Web

The conceptual framework of a new system, Collaborative Refinery, is motivated by a scenario involving the creation of an FAQ. The scenario introduces the concepts of collecting, culling, organizing and distilling. Distilling is a specialized form of collaborative authoring with support for content selection and genre. The Web-based user interface supporting access to the four conceptual functi...

متن کامل

Pinterest Analysis and Recommendations

Pinterest is a visual discovery tool for collecting and organizing content on the Web with over 70 million users. Users “pin” images, videos, articles, products, and other objects they find on the Web, and organize them into boards by topic. Other users can repin these and also follow other users or boards. Each user organizes things differently, and this produces a vast amount of human-curated...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006